Picture for Tong Zhang

Tong Zhang

Nanjing University of Science and Technology, Nanjing, China

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

Add code
May 05, 2025
Viaarxiv icon

RM-R1: Reward Modeling as Reasoning

Add code
May 05, 2025
Viaarxiv icon

Multi-Step Consistency Models: Fast Generation with Theoretical Guarantees

Add code
May 02, 2025
Viaarxiv icon

MIMIC-\RNum{4}-Ext-22MCTS: A 22 Millions-Event Temporal Clinical Time-Series Dataset with Relative Timestamp for Risk Prediction

Add code
May 01, 2025
Viaarxiv icon

Capturing Conditional Dependence via Auto-regressive Diffusion Models

Add code
Apr 30, 2025
Viaarxiv icon

STCL:Curriculum learning Strategies for deep learning image steganography models

Add code
Apr 24, 2025
Viaarxiv icon

CLPSTNet: A Progressive Multi-Scale Convolutional Steganography Model Integrating Curriculum Learning

Add code
Apr 23, 2025
Viaarxiv icon

MMHCL: Multi-Modal Hypergraph Contrastive Learning for Recommendation

Add code
Apr 23, 2025
Viaarxiv icon

EarthGPT-X: Enabling MLLMs to Flexibly and Comprehensively Understand Multi-Source Remote Sensing Imagery

Add code
Apr 17, 2025
Viaarxiv icon

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Add code
Apr 15, 2025
Viaarxiv icon